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4 <110> APPLICANT: Schellenberger, Volker 

5 Liu, Amy D. 

6 Selifonova, Olga V. 

8 <120> TITLE OF INVENTION: Directed Evolution of Microorganisms 
11 <130> FILE REFERENCE: GC560-D1 

13 <140> CURRENT APPLICATION NUMBER: US 10/037, 677A 

14 <141> CURRENT FILING DATE: 2001-10-23 

16 <150> PRIOR APPLICATION NUMBER: US 09/314,847 

17 <151> PRIOR FILING DATE: 1999-05-19 
19 <160> NUMBER OF SEQ ID NOS : 17 

21 <170> SOFTWARE: FastSEQ for Windows Version 4.0 

23 <210> SEQ ID NO: 1 

24 <211> LENGTH: 741 

25 <212> TYPE: DNA 

26 <213> ORGANISM: Escherichia coli 

28 <4 00> SEQUENCE: 1 

29 atgaccgcta tgagcactgc aattacacgc cagatcgttc tcgataccga aaccaccggt 

30 atgaaccaga ttggtgcgca ctatgaaggc cacaagatca ttgagattgg tgccgttgaa 

31 gtggtgaacc gtcgcctgac gggcaataac ttccatgttt atctcaaacc cgatcggctg 

32 gtggatccgg aagcctttgg cgtacatggt attgccgatg aatttttgct cgataagccc 

33 acgtttgccg aagtagccga tgagttcatg gactatattc gcggcgcgga gttggtgatc 

34 cataacgcag cgttcgatat cggctttatg gactacgagt 

35 attccgaaga ccaatacttt ctgtaaggtc accgatagcc 

36 tttcccggta agcgcaacag cctcgatgcg ttatgtgctc gctacgaaat agataacagt 

37 aaacgaacgc tgcacggggc attactcgat gcccagatcc ttgcggaagt ttatctggcg 

38 atgaccggtg gtcaaacgtc gatggctttt gcgatggaag gagagacaca acagcaacaa 

39 ggtgaagcaa caattcagcg cattgtacgt caggcaagta agttacgcgt tgtttttgcg 

40 acagatgaag agattgcagc tcatgaagcc cgtctcgatc tggtgcagaa gaaaggcgga 

41 agttgcctct ggcgagcata a 

43 <210> SEQ ID NO: 2 

44 <211> LENGTH: 246 

45 <212> TYPE: PRT 

46 <213> ORGANISM: Escherichia coli 
48 <400> SEQUENCE: 2 

4 9 Met Thr Ala Met Ser Thr Ala He Thr Arg Gin He Val Leu Asp Thr 



tttcgttgct taagcgcgat 
ttgcggtggc gaggaaaatg 



50 1 



10 



15 



51 Glu Thr Thr- Gly Met Asn Gin He Gly Ala His Tyr Glu Gly His Lys 



52 



20 



25 



30 



53 He He Glu He Gly Ala Val Glu Val Val Asn Arg Arg Leu Thr Gly 



54 



35 



40 



45 



55 Asn Asn Phe His Val Tyr Leu Lys Pro Asp Arg Leu Val Asp Pro Glu 



56 



50 



55 



60 



57 Ala Phe Gly Val His Gly He Ala Asp Glu Phe Leu Leu Asp Lys Pro 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
741 
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58 65 70 75 

59 Thr Phe Ala Glu Val Ala Asp Glu Phe Met Asp 



60 



85 



90 



61 Glu Leu Val He His Asn Ala Ala Phe Asp He 



62 



100 



105 



63 Glu Phe Ser Leu Leu Lys Arg Asp He Pro Lys 

64 115 120 

65 Lys Val Thr Asp Ser Leu Ala Val Ala Arg Lys 

66 130 135 

67 Arg Asn Ser Leu Asp Ala Leu Cys Ala Arg Tyr 

68 145 150 " 155 

69 Lys Arg Thr Leu His Gly Ala Leu Leu Asp Ala 

70 165 170 

71 Val Tyr Leu. Ala Met Thr Gly Gly Gin Thr Ser 

72 180 185 

73 Glu Gly Glu Thr Gin Gin Gin Gin Gly Glu Ala 

74 195 200 

75 Val Arg Gin Ala Ser Lys Leu Arg Val Val Phe 

76 210 215 

77 He Ala Ala His Glu Ala Arg Leu Asp Leu Val 

78 225 230 235 

79 Ser Cys Leu Trp Arg Ala 

80 245 

82 <210> SEQ ID NO: 3 

83 <211> LENGTH: 1164 

84 <212> TYPE: DNA 

85 <213> ORGANISM: Escherichia blattae 

87 <400> SEQUENCE: 3 

88 atgagctatc gtatgtttga ttatctggtt ccaaatgtga 

89 gtttctgttg ttggccagcg ctgccagctg ctggggggta 

90 gataagggcc tgcgcgccat taaagacggt gctgtcgatc 

91 gccgccggta ttgaggtggt cattttcgac ggggtcgagc 

92 gtgctcgacg gcctggccat gttccgtaaa gagcagtgcg 

93 ggcggcagcc cgcacgactg cggtaaaggc attggtattg 

94 ctgtacagct atgccggtat cgaaacactc accaacccgc 

95 aacaccaccg ccgggaccgc cagcgaagtc acccgccact 

96 accaaagtaa aatttgtgat tgtcagctgg cgcaacctgc 

97 ccgctgctga tgatcggcaa gcccgccggg ctgaccgccg 

98 acccacgcgg tagaggccta tatctccaaa gacgccaacc 

99 attcaggcca tcaaactgat tgccaccaac ttgcgccagg 

100 ctcaaagccc gtgaaaacat ggcctgcgcc tctctgctgg 

101 gccaacctgg gctatgttca cgccatggct caccagctgg 

102 cacggggtgg cgaacgcggt cctgctgccc catgtctgcc 

103 ccggaaaaat ttgccgatat cgccaccttt atgggggaaa 

104 atggacgcag cggagctggc catcagcgcc attgcccgtc 

105 ccgcagcacc tgcgtgaact gggggtaaaa gaggccgact 

106 gccctgaaag acggcaacgc cttctctaac ccgcgcaaag 

107 gacattttcc gccaggcatt ctga 
109 <210> SEQ ID NO: 4 



Tyr He 

Gly Phe 

Thr Asn 
125 
Met Phe 
140 

Glu He 

Gin He 

Met Ala 

Thr lie 
205 
Ala Thr 
220 

Gin Lys 



80 ' 

Arg Gly Ala 
95 

Met Asp Tyr 
110 

Thr Phe Cys 

Pro Gly Lys 

Asp Asn Ser 
160 

Leu Ala Glu 

175 
Phe Ala Met 
190 

Gin Arg He 

Asp Glu Glu 

Lys Gly Gly 
240 



acttctttgg 
aaaaagccct 
agaccgtgaa 
cgaacccgaa 
acatgataat 
cggccaccca 
tgccgcccat 
gcgtgctgac 
cttccgtctc 
ccaccggtat 
cggttaccga 
ccgtcgccct 
ccgggatggc 
gcggcctgta 
gctataacct 
acaccaccgg 
tgtctaaaga 
tcccgtacat 
ggaacgaaaa 



cccgggcgcc 
gctggtgacc 
gcacctgaaa 
agacaccaac 
caccgtcggc 
cccgggtgat 
tattgcggtc 
taacaccaaa 
cattaacgat 
ggatgccctg 
tgcctctgct 
ggggaccaac 
ctttaacaac 
cgacatggcc 
gattgccaac 
tctttccacc 
tgtcgggatc 
ggcagaaatg 
agagattgcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1164 
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110 


<211> LENGTH: 387 
























111 


<212> TYPE: 


PRT 


























112 


<213> ORGANISM: 


Escherichia 


blattae 
















114 


<400> SEQUENCE: 


4 
























115 


Met 


Ser 


Tyr 


Arg 


Met 


Phe 


Asp 


Tyr 


Leu 


Val 


Pro 


Asn 


Val 


Asn 


Phe 


Phe 


116 


1 








5 










10 










15 




117 


Gly 


Pro 


Gly 


Ala 


Val 


Ser 


Val 


Val 


Gly 


Gin 


Arg 


Cys 


Gin 


Leu 


Leu 


Gly 


118 








20 










25 










30 






119 


Gly 


Lys 


Lys 


Ala 


Leu 


Leu 


Val 


Thr 


Asp 


Lys 


Gly 


Leu 


Arg 


Ala 


He 


Lys 


120 






35 










40 










45 








121 


Asp 


Gly 


Ala 


Val 


Asp 


Gin 


Thr 


Val 


Lys 


His 


Leu 


Lys 


Ala 


Ala 


Gly 


He 


122 




50 










55 










60 










123 


Glu 


Val 


Val 


He 


Phe 


Asp 


Gly 


Val 


Glu 


Pro 


Asn 


Pro 


Lys 


Asp 


Thr 


Asn 


124 


65 










70 










75- 










80 


125 


Val 


Leu 


Asp 


Gly 


Leu 


Ala 


Met 


Phe 


Arg 


Lys 


Glu 


Gin 


Cys 


Asp 


Met 


He 


126 










85 










90 










95 




127 


He 


Thr 


Val 


Gly 


Gly 


Gly 


Ser 


Pro 


His 


Asp 


Cys 


Gly 


Lys 


Gly 


He 


Gly 


128 








100 










105 










110 






129 


He 


Ala 


Ala 


Thr 


His 


Pro 


Gly 


Asp 


Leu 


Tyr 


Ser 


Tyr 


Ala 


Gly 


lie 


Glu 


130 






115 










120 










125 








131 


Thr 


Leu 


Thr 


Asn 


Pro 


Leu 


Pro 


Pro 


He 


He 


Ala 


Val 


Asn 


Thr 


Thr 


Ala 


132 




130 










135 










140 










133 


Gly 


Thr 


Ala 


Ser 


Glu 


Val 


Thr 


Arg 


His 


Cys 


Val 


Leu 


Thr 


Asn 


Thr 


Lys 


134 


145 










150 










155 










160 


135 


Thr 


Lys 


Val 


Lys 


Phe 


Val 


He 


Val 


Ser 


Trp 


Arg 


Asn 


Leu 


Pro 


Ser 


Val 


136 










165 










170 










175 




137 


Ser 


He 


Asn 


Asp 


Pro 


Leu 


Leu 


Met 


He 


Gly 


Lys 


Pro 


Ala 


Gly 


Leu 


Thr 


138 








180 










185 










190 






139 


Ala 


Ala 


Thr 


Gly 


Met 


Asp 


Ala 


Leu 


Thr 


His 


Ala 


Val 


Glu 


Ala 


Tyr 


lie 


140 






195 










200 










205 








141 


Ser 


Lys 


Asp 


Ala 


Asn 


Pro 


Val 


Thr 


Asp 


Ala 


Ser 


Ala 


He 


Gin 


Ala 


lie 


142 




210 










215 










220 










143 


Lys 


Leu 


He 


Ala 


Thr 


Asn 


Leu 


Arg 


Gin 


Ala 


Val 


Ala 


Leu 


Gly 


Thr 


Asn 


144 


225 










230 










235 










240 


145 


Leu 


Lys 


Ala 


Arg 


Glu 


Asn 


Met 


Ala 


Cys 


Ala 


Ser 


Leu 


Leu 


Ala 


Gly Met 


146 










z4o 










250 










255 




147 


Ala 


Phe 


Asn 


Asn 


Ala 


Asn 


Leu 


Gly 


Tyr 


Val 


His 


Ala 


Met 


Ala 


His 


Gin 


148 








260 










265 










270 






14 y 


Leu 


Gly 


Gly 


Leu 


Tyr 


Asp 


Met 


Ala 


His 


Gly 


Val 


Ala 


Asn 


Ala 


Val 


Leu 


150 






275 










280 










285 








151 


Leu 


Pro 


His 


Val 


Cys 


Arg 


Tyr 


Asn 


Leu 


He 


Ala 


Asn 


Pro 


Glu 


Lys 


Phe 


152 




290 










295 










300 










153 


Ala 


Asp 


He 


Ala 


Thr 


Phe 


Met 


Gly 


Glu 


Asn 


Thr 


Thr 


Gly 


Leu 


Ser 


Thr 


154 


305 










310 










315 










320 


155 


Met 


Asp 


Ala 


Ala 


Glu 


Leu 


Ala 


He 


Ser 


Ala 


He 


Ala 


Arg 


Leu 


Ser 


Lys 


156 










325 










330 










335 




157 


Asp 


Val 


Gly 


He 


Pro 


Gin 


His 


Leu 


Arg 


Glu 


Leu 


Gly 


Val 


Lys 


Glu 


Ala 


158 








340 










345 










350 






159 


Asp 


Phe 


Pro 


Tyr 


Met 


Ala 


Glu 


Met 


Ala 


Leu 


Lys 


Asp 


Gly 


Asn 


Ala 


Phe 
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160 355 360 365 

161 Ser Asn Pro Arg Lys Gly Asn Glu Lys Glu He Ala Asp He Phe Arq 

162 370 375 • 380 

163 Gin Ala Phe 

164 385 

166 <210> SEQ ID NO: 5 

167 <211> LENGTH: 1164 

168 <212> TYPE: DNA 

169 <213> ORGANISM: Escherichia blattae 

171 <4 00> SEQUENCE: 5 

172 atgagctatc gtatgtttga ttatctggtt ccaaatgtra acttctttgg cccgggcgcc 60 

173 gtttctgttg ttggccagcg ctgccagctg ctggggggta aaaaagccct gctggtgacc 120 

174 gataagggcc tgcgcgccat taaagacggt gctgtcgatc agaccgtgaa gcacctgaaa 180 

175 gccgccggta ttgaggtggt cattttcgac ggggtcgagc cgaacccgaa agacaccaac 24 0 

176 gtgctcgacg gcctggccat gttccgtaaa gagcagtgcg acatgataat caccgtcggc 300 

177 ggcggcagcc cgctcgactg cggtaaaggc attggtattg cggccaccca cccgggtgat 360 

178 ctgtacagct atgccggtat cgaaacactc accaacccgc tgccgcccat tattgcggtc 420 

179 aacaccaccg ccgggaccgc cagcgaagtc acccgccact gcgtgctgac taacaccaaa 480 

180 accaaagtaa aatttgtgat tgtcagctgg cgcaacctgc cttccgtctc cattaacgat 54 0 

181 ccgctgctga tgatcggcaa gcccgccggg ctgaccgccg ccaccggtat ggatgccctg 600 

182 acccacgcgg tagaggccta tatctccaaa gacgccaacc cggttaccga tgcctctgct 660 

183 attcaggcca tcaaactgat . tgccaccaac ttgcgccagg ccgtcgccct ggggaccaac 720 

184 ctcaaagccc gtgaaaacat ggcctgcgcc tctctgctgg ccgggatggc ctttaacaac 780 

185 gccaacctgg gctatgttca cgccatggct caccagctgg gcggcctgta cgacatggcc 

186 cacggggtgg cgaacgcggt cctgctgccc catgtctgcc gctataacct gattgccaac 

187 f^^^^t 4--I ~ 4-_4- 



-■ -J Zj ^ ^ ^ L , _ ^, ^ Li l. U V ^. d d ^ 

ccggaaaaat ttgccgatat cgccaccttt atgggggaaa acaccaccgg tctttccacc 



840 
900 
960 



IBS atggacgcag cggagctggc catcagcgcc attgcccgtc tgtctaaaga tgtcgggatc 1020 
189 ccgcagcacc tgcgtgaact gggggtaaaa gaggccgact tcccgtacat ggcagaaatg 



- - - --j- zj z> ~~ w^^w^^^^«k- yyuayaaaty 1080 

190 gccctgaaag acggcaacgc cttctctaac ccgcgcaaag ggaacgaaaa agagattgcc 1140 

191 gacattttcc gccaggcatt ctga 

193 <210> SEQ ID NO: 6 

194 <2ll> LENGTH: 387 

195 <212> TYPE: PRT 

196 <213> ORGANISM: Escherichia blattae 

198 <400> SEQUENCE: 6 

199 Met Ser Tyr Arg Met Phe Asp Tyr Leu Val Pro Asn Val Asn Phe Phe 

200 1 5 1Q 15 

201 Gly Pro Gly Ala Val Ser Val Val Gly Gin Arg Cys Gin Leu Leu Gly 

202 20 25 30 

203 Gly Lys Lys Ala Leu Leu Val Thr Asp Lys Gly Leu Arg Ala lie Lys 

204 35 40. " 45 

205 Asp Gly Ala Val Asp Gin Thr Val Lys His Leu Lys Ala Ala Gly He 

206 50 55 60 

207 Glu Val Val He Phe Asp Gly Val Glu Pro Asn Pro Lys Asp Thr Asn 

208 65 70 75 " ' 80 

209 Val Leu Asp Gly Leu Ala Met Phe Arg Lys Glu Gin Cys Asp Met He 

210 85 90 ' 95 

211 He Thr Val Gly Gly Gly Ser Pro Leu Asp Cys Gly Lys Gly He Gly 

212 100 105 110 



1164 
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213 He Ala Ala Thr His Pro Gly Asp Leu Tyr Ser Tyr Ala Gly He Glu 

214 H5 120 125 

215 Thr Leu Thr Asn Pro Leu Pro Pro He He Ala Val Asn Thr Thr Ala 

216 130 135 140 

217 Gly Thr Ala Ser Glu Val Thr Arg His Cys Val Leu Thr Asn Thr Lys 

218 145 150 155 160 

219 Thr Lys Val Lys Phe Val He Val Ser Trp Arg Asn Leu Pro Ser Val 

220 165 170 175 

221 Ser He Asn Asp Pro Leu Leu Met He Gly Lys Pro Ala Gly Leu Thr 

222 180 185 190 

223 Ala Ala Thr Gly Met Asp Ala Leu Thr His Ala Val Glu Ala Tyr He 

224 195 200 205 

225 Ser Lys Asp Ala Asn Pro Val Thr Asp Ala Ser Ala He Gin Ala He 

226 210 215 220 

227 Lys Leu He Ala Thr Asn Leu Arg Gin Ala Val Ala Leu Gly Thr Asn 

228 225 230 235 ' 240 

229 Leu Lys Ala Arg Glu Asn Met Ala Cys Ala Ser Leu Leu Ala Gly Met 

230 245 250 255 

231 Ala Phe Asn Asn Ala Asn Leu Gly Tyr Val His Ala Met Ala His Gin 

232 260 265 270 

233 Leu Gly Gly Leu Tyr Asp Met Ala His Gly Val Ala Asn Ala Val Leu 

234 275 280 285 

235 Leu Pro His Val Cys Arg Tyr Asn Leu He Ala Asn Pro Glu Lys Phe 

2 36 290 . 295 300 

237 Ala Asp He Ala Thr Phe Met Gly Glu Asn Thr Thr Gly Leu Ser Thr 

238 305 310 315 " 32Q 

239 Met Asp Ala Ala Glu Leu Ala He Ser Ala He Ala Arg Leu Ser Lvs 

240 . 325 330 335 

241 Asp Val Gly He Pro Gin His Leu Arg Glu Leu Gly Val Lys Glu Ala 

242 340 345 350 

243 Asp Phe Pro Tyr Met Ala Glu Met Ala Leu Lys Asp Gly Asn Ala Phe 

244 355 360 365 

245 Ser Asn Pro Arg Lys Gly Asn Glu Lys Glu He Ala Asp He Phe Arq 

246 370 375 " 380 

247 Gin Ala Phe 

248 385 

250 <210> SEQ ID NO: 7 

251 <211> LENGTH: 12 

252 <212> TYPE: DNA 

253 <213> ORGANISM: Artificial Sequence 

255 <220> FEATURE: 

256 <223> OTHER INFORMATION: wild type mutD gene 

258 <400> SEQUENCE: 7 

259 atgaccgcta tg ,„ 

261 <210> SEQ ID NO: 8 

262 <211> LENGTH: -11 

263 <212> TYPE: DNA 

264 <213> ORGANISM: Artificial Sequence 
266 <220> FEATURE: 
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